NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

cache_ext: Customizing the Page Cache with eBPF

https://doi.org/10.1145/3731569.3764820

Zussman, Tal; Zarkadas, Ioannis; Carin, Jeremy; Cheng, Andrew; Franke, Hubertus; Pfefferle, Jonas; Cidon, Asaf (October 2025, ACM)

The OS page cache is central to the performance of many applications, by reducing excessive accesses to storage. However, its one-size-fits-all eviction policy performs poorly in many workloads. While the systems community has experimented with a plethora of new and adaptive eviction policies in non-OS settings (e.g., key-value stores, CDNs), it is very difficult to implement such policies in the page cache, due to the complexity of modifying kernel code. To address these shortcomings, we design a flexible eBPF-based framework for the Linux page cache, called cache_ext, that allows developers to customize the page cache without modifying the kernel. cache_ext enables applications to customize the page cache policy for their specific needs, while also ensuring that different applications’ policies do not interfere with each other and preserving the page cache’s ability to share memory across different processes. We demonstrate the flexibility of cache_ext’s interface by using it to implement eight different policies, including sophisticated eviction algorithms. Our evaluation shows that it is indeed beneficial for applications to customize the page cache to match their workloads’ unique properties, and that they can achieve up to 70% higher throughput and 58% lower tail latency.
more » « less
Free, publicly-accessible full text available October 12, 2026
Fusion: An Analytics Object Store Optimized for Query Pushdown

Lu, Jianan; Raina, Ashwini; Cidon, Asaf; Freedman, Michael J (March 2025, ASPLOS '25: Proceedings of the 30th ACM International Conference on Architectural Support for Programming Languages and Operating Systems)

The prevalence of disaggregated storage in public clouds has led to increased latency in modern OLAP cloud databases, particularly when handling ad-hoc and highly-selective queries on large objects. To address this, cloud databases have adopted computation pushdown, executing query predicates closer to the storage layer. However, existing pushdown solutions are ine!cient in erasure-coded storage. Cloud storage employs erasure coding that partitions analytics file objects into fixed-sized blocks and distributes them across storage nodes. Consequently, when a speci"c part of the object is queried, the storage system must reassemble the object across nodes, incurring significant network latency. In this work, we present Fusion, an object store for analytics that is optimized for query pushdown on erasure-coded data. It co-designs its erasure coding and file placement topologies, taking into account popular analytics file formats (e.g., Parquet). Fusion employs a novel stripe construction algorithm that prevents fragmentation of computable units within an object, and minimizes storage overhead during erasure coding. Compared to existing erasure-coded stores, Fusion improves median and tail latency by 64% and 81%, respectively, on TPC-H, and up to 40% and 48% respectively, on real-world SQL queries. Fusion achieves this while incurring a modest 1.2% storage overhead compared to the optimal.
more » « less
Free, publicly-accessible full text available March 30, 2026
Characterizing the Networks Sending Enterprise Phishing Emails

https://doi.org/10.1007/978-3-031-85960-1_18

Luo, Elisa; Young, Liane; Ho, Grant; Afifi, M H; Schweighauser, Marco; Katz-Bassett, Ethan; Cidon, Asaf (January 2025, Springer Nature Switzerland)

Full Text Available
Managing Memory Tiers with CXL in Virtualized Environments

Zhong, Yuhong; Berger, Daniel S; Agarwal, Ishwar; Agarwal, Rajat; Hady, Frank; Waldspurger, Carl; Wee, Ryan; Kumar, Karthik; Hill, Mark D; Chowdhury, Mosharaf; et al (July 2024, USENIX OSDI)

Full Text Available
Treehouse: A Case For Carbon-Aware Datacenter Software

https://doi.org/10.1145/3630614.3630626

Anderson, Thomas; Belay, Adam; Chowdhury, Mosharaf; Cidon, Asaf; Zhang, Irene (October 2023, ACM SIGEnergy Energy Informatics Review)

The end of Dennard scaling and the slowing of Moore's Law has put the energy use of datacenters on an unsustainable path. Datacenters are already a significant fraction of worldwide electricity use, with application demand scaling at a rapid rate. We argue that substantial reductions in the carbon intensity of datacenter computing are possible with a software-centric approach: by making energy and carbon visible to application developers on a fine-grained basis, by modifying system APIs to make it possible to make informed trade offs between performance and carbon emissions, and by raising the level of application programming to allow for flexible use of more energy efficient means of compute and storage. We also lay out a research agenda for systems software to reduce the carbon footprint of datacenter computing.
more » « less
Full Text Available
Karma: Resource Allocation for Dynamic Demands

Vuppalapati, Midhul; Fikioris, Giannis; Agarwal, Rachit; Cidon, Asaf; Khandelwal, Anurag; Tardos, Eva (July 2023, USENIX Symposium on Operating Systems Design and Implementation)

Full Text Available
Efficient Compactions between Storage Tiers with PrismDB

https://doi.org/10.1145/3582016.3582052

Raina, Ashwini; Lu, Jianan; Cidon, Asaf; Freedman, Michael J. (March 2023, ASPLOS 2023: Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems)

In recent years, emerging storage hardware technologies have focused on divergent goals: better performance or lower cost-per-bit. Correspondingly, data systems that employ these technologies are typically optimized either to be fast (but expensive) or cheap (but slow). We take a different approach: by architecting a storage engine to natively utilize two tiers of fast and low-cost storage technologies, we can achieve a Pareto efficient balance between performance and cost-per-bit. This paper presents the design and implementation of PrismDB, a novel key-value store that exploits two extreme ends of the spectrum of modern NVMe storage technologies (3D XPoint and QLC NAND) simultaneously. Our key contribution is how to efficiently migrate and compact data between two different storage tiers. Inspired by the classic cost-benefit analysis of log cleaning, we develop a new algorithm for multi-tiered storage compaction that balances the benefit of reclaiming space for hot objects in fast storage with the cost of compaction I/O in slow storage. Compared to the standard use of RocksDB on flash in datacenters today, PrismDB’s average throughput on tiered storage is 3.3x faster, its read tail latency is 2x better, and it is 5x more durable using equivalently-priced hardware.
more » « less
Full Text Available
Memtrade: Marketplace for Disaggregated Memory Clouds

https://doi.org/10.1145/3589985

Maruf, Hasan Al; Zhong, Yuhong; Wang, Hongyi; Chowdhury, Mosharaf; Cidon, Asaf; Waldspurger, Carl (May 2023, Proceedings of the ACM on Measurement and Analysis of Computing Systems)

We present Memtrade, the first practical marketplace for disaggregated memory clouds. Clouds introduce a set of unique challenges for resource disaggregation across different tenants, including resource harvesting, isolation, and matching. Memtrade allows producer virtual machines (VMs) to lease both their unallocated memory and allocated-but-idle application memory to remote consumer VMs for a limited period of time. Memtrade does not require any modifications to host-level system software or support from the cloud provider. It harvests producer memory using an application-aware control loop to form a distributed transient remote memory pool with minimal performance impact; it employs a broker to match producers with consumers while satisfying performance constraints; and it exposes the matched memory to consumers through different abstractions. As a proof of concept, we propose two such memory access interfaces for Memtrade consumers -- a transient KV cache for specified applications and a swap interface that is application-transparent. Our evaluation using real-world cluster traces shows that Memtrade provides significant performance benefit for consumers (improving average read latency up to 2.8X) while preserving confidentiality and integrity, with little impact on producer applications (degrading performance by less than 2.1%).
more » « less
Full Text Available
BPF-oF: Storage Function Pushdown Over the Network

Zarkadas, Ioannis; Zussman, Tal; Carin, Jeremy; Jiang, Sheng; Zhong, Yuhong; Pfefferle, Jonas; Franke, Hubertus; Yang, Junfeng; Kaffes, Kostis; Stutsman, Ryan; et al (September 2023, Arxiv)

Full Text Available
Treehouse: A Case For Carbon-Aware Datacenter Software

Anderson, Thomas; Belay, Adam; Chowdhury, Mosharaf; Cidon, Asaf; Zhang, Irene (July 2022, HotCarbon: 1st Workshop on Sustainable Computer Systems Design and Implementation)

The end of Dennard scaling and the slowing of Moore’s Law has put the energy use of datacenters on an unsustainable path. Datacenters are already a significant fraction of worldwide electricity use, with application demand scaling at a rapid rate. We argue that substantial reductions in the carbon intensity of datacenter computing are possible with a software-centric approach: by making energy and carbon visible to application developers on a fine-grained basis, by modifying system APIs to make it possible to make informed trade offs between performance and carbon emissions, and by raising the level of application programming to allow for flexible use of more energy efficient means of compute and storage.We also lay out a research agenda for systems software to reduce the carbon footprint of datacenter computing.
more » « less
Full Text Available

« Prev Next »

Search for: All records